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(1) GENERAL INFORMATION: 
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(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: 501 Office Center Drive, Suite 210 

(C) CITY: Fort Washington 

(D) STATE: Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19034 

(V) COMPUTER READABLE FORM: 

(A) medium type: Floppy disk 

(B) computer: IBM PC compatible 
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(A) NAME: Bak, Mary E. 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1047 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AAG AAT AAT GAT CAT GAT AAT CAT AAG GGG ACT GTT AAG AAT GCT GTT 48 

Lys Asn Asn Asp His Asp Asn His Lys Gly Thr val Lys Asn Ala val 
15 10 15 

GAT ATG GCA AAG GCC GCT GAG GAA GCT GCA AGT GCT GCA AGT GCT GCT 96 
Asp Met Ala Lys Ala Ala Glu Glu Ala Ala Ser Ala Ala Ser Ala Ala 
20 25 30 

ACT GGT AAT GCA GCG ATT GGG GAT GTT GTT AAG AAT AGT GGG GCA GCA 144 
Thr Gly Asn Ala Ala lie Gly Asp val val Lys Asn ser Gly Ala Ala 
35 40 45 

GCA AAA GGT GGT GAG GCG GCG AGT GTT AAT GGG ATT GCT AAG GGG ATA 192 

Ala Lys Gly Gly Glu Ala Ala Ser val Asn Gly lie Ala Lys Gly lie 
50 55 60 

AAG GGG ATT GTT GAT GCT GCT GGA AAG GCT GAT GCG AAG GAA GGG AAG 240 
Lys Gly lie val Asp Ala Ala Gly Lys Ala Asp Ala Lys Glu Gly Lys 
65 70 75 80 

TTG GAT GCT ACT GGT GCT GAG GGT ACG ACT AAC GTG AAT GCT GGG AAG 288 
Leu Asp Ala Thr Gly Ala Glu Gly Thr Thr Asn val Asn Ala Gly Lys 
85 90 95 

TTG TTT GTG AAG AGG GCG GCT GAT GAT GGT GGT GAT GCA GAT GAT GCT 336 

Leu Phe val Lys Arg Ala Ala Asp Asp Gly Gly Asp Ala Asp Asp Ala 
100 105 110 

GGG AAG GCT GCT GCT GCG GTT GCT GCA AGT GCT GCT ACT GGT AAT GCA 384 
Gly Lys Ala Ala Ala Ala Val Ala Ala Ser Ala Ala Thr Gly Asn Ala 
115 120 125 

GCG ATT GGA GAT GTT GTT AAT GGT GAT GTG GCA AAA GCA AAA GGT GGT 432 

Ala lie Gly Asp val val Asn Gly Asp val Ala Lys Ala Lys Gly Gly 
130 135 140 

GAT GCG GCG AGT GTT AAT GGG ATT GCT AAG GGT ATA AAG GGG ATT GTT 480 
Asp Ala Ala ser Val Asn Gly lie Ala Lys Gly lie Lys Gly lie val 
145 150 155 160 

GAT GCT GCT GAG AAG GCT GAT GCG AAG GAA GGG AAG TTG AAT GCT GCT 528 
Asp Ala Ala Glu Lys Ala Asp Ala Lys Glu Gly Lys Leu Asn Ala Ala 
165 170 175 

GGT GCT GAG GGT ACG ACT AAC GCG GAT GCT GGG AAG TTG TTT GTG AAG 576 

Gly Ala Glu Gly Thr Thr Asn Ala Asp Ala Gly Lys Leu Phe Val Lys 
180 185 190 

AAT GCT GGT AAT GTG GGT GGT GAA GCA GGT GAT GCT GGG AAG GCT GCT 624 
Asn Ala Gly Asn val Gly Gly Glu Ala Gly Asp Ala Gly Lys Ala Ala 
195 200 205 

GCT GCG GTT GCT GCT GTT AGT GGG GAG CAG ATA TTA AAA GCG ATT GTT 672 
Ala Ala val Ala Ala val ser Gly Glu Gin lie Leu Lys Ala lie val 
210 215 220 



CAT GCT GCT AAG GAT GGT GGT GAG AAG CAG GGT AAG AAG GCT GCG GAT 720 

i 



His Ala Ala Lys Asp Gly Gly Glu Lys Gin Gly Lys Lys Ala Ala Asp 
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225 230 235 240 

CGT ACA AAT CCC ATT GAC GCG GCT ATT GGG GGT GCG GGT GAT AAT GAT 768 
Arg Thr Asn Pro He Asp Ala Ala lie Gly Gly Ala Gly Asp Asn Asp 
245 250 255 

GCT GCT GCG GCG TTT GCT ACT ATG AAG AAG GAT GAT CAG ATT GCT GCT 816 
Ala Ala Ala Ala Phe Ala Thr Met Lys Lys Asp Asp Gin lie Ala Ala 
260 265 270 

GCT ATG GTT CTG AGG GGA ATG GCT AAG GAT GGG CAA TTT GCT TTG AAG 864 
Ala Met val Leu Arg Gly Met Ala Lys Asp Gly Gin Phe Ala Leu Lys 
275 280 285 

GAT GCT GCT GCT GCT CAT GAA GGG ACT GTT AAG AAT GCT GTT GAT ATA 912 

Asp Ala Ala Ala Ala His Glu Gly Thr val Lys Asn Ala val Asp lie 
290 295 300 

ATA AAG GCT GCT GCG GAA GCT GCA AGT GCT GCA AGT GCT GCT ACT GGT 960 
lie Lys Ala Ala Ala Glu Ala Ala Ser Ala Ala Ser Ala Ala Thr Gly 
305 310 315 320 

AGT GCA GCA ATT GGG GAT GTT GTT AAT GGT AAT GGA GCA ACA GCA AAA 1008 
Ser Ala Ala lie Gly Asp val val Asn Gly Asn Gly Ala Thr Ala Lys 
325 330 335 

GGT GGT GAT GCG AAG AGT GTT AAT GGC ATT GCT AAG GGA 1047 
Gly Gly Asp Ala Lys Ser val Asn Gly lie Ala Lys Gly 
340 345 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Lys Asn Asn Asp His Asp Asn His Lys Gly Thr val Lys Asn Ala val 
1 5 10 15 

Asp Met Ala Lys Ala Ala Glu Glu Ala Ala ser Ala Ala Ser Ala Ala 
20 25 30 

Thr Gly Asn Ala Ala lie Gly Asp Val Val Lys Asn Ser Gly Ala Ala 
35 40 45 

Ala Lys Gly Gly Glu Ala Ala ser val Asn Gly lie Ala Lys Gly lie 
50 55 60 

Lys Gly lie val Asp Ala Ala Gly Lys Ala Asp Ala Lys Glu Gly Lys 
65 70 75 80 

Leu Asp Ala Thr Gly Ala Glu Gly Thr Thr Asn Val Asn Ala Gly Lys 
85 90 95 

Leu Phe val Lys Arg Ala Ala Asp Asp Gly Gly Asp Ala Asp Asp Ala 
100 105 110 

Gly Lys Ala Ala Ala Ala val Ala Ala ser Ala Ala Thr Gly Asn Ala 
115 120 125 
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Ala lie Gly Asp val val Asn Gly Asp val Ala Lys Ala Lys Gly Gly 
130 135 140 

Asp Ala Ala ser val Asn Gly lie Ala Lys Gly lie Lys Gly lie val 
145 150 155 160 

Asp Ala Ala Glu Lys Ala Asp Ala Lys Glu Gly Lys Leu Asn Ala Ala 
165 170 175 

Gly Ala Glu Gly Thr Thr Asn Ala Asp Ala Gly Lys Leu Phe Val Lys 
180 185 190 

Asn Ala Gly Asn val Gly Gly Glu Ala Gly Asp Ala Gly Lys Ala Ala 
195 200 205 

Ala Ala val Ala Ala val ser Gly Glu Gin lie Leu Lys Ala lie val 
210 215 220 

His Ala Ala Lys Asp Gly Gly Glu Lys Gin Gly Lys Lys Ala Ala Asp 
225 230 235 240 

Arg Thr Asn Pro lie Asp Ala Ala lie Gly Gly Ala Gly Asp Asn Asp 
245 250 255 

Ala Ala Ala Ala Phe Ala Thr Met Lys Lys Asp Asp Gin lie Ala Ala 
260 265 270 

Ala Met val Leu Arg Gly Met Ala Lys Asp Gly Gin Phe Ala Leu Lys 
275 280 285 

Asp Ala Ala Ala Ala His Glu Gly Thr val Lys Asn Ala val Asp lie 
290 295 300 

lie Lys Ala Ala Ala Glu Ala Ala Ser Ala Ala Ser Ala Ala Thr Gly 
305 310 315 320 

Ser Ala Ala lie Gly Asp Val Val Asn Gly Asn Gly Ala Thr Ala Lys 
325 330 335 

Gly Gly Asp Ala Lys Ser Val Asn Gly lie Ala Lys Gly 
340 345 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 283 base pairs 

(B) TYPE: nucleic acid 
(c) strandedness: double 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCCGCTGGAT GGTGGTGAGA AGCAGGGTAA GAAGGCTGCG GATCGTACAA ATCCCATTGA 60 

CGCGGCTATT GGGGGTGCGG GTGATAATGA TGCTGCTGCG GCGTTTGCTA CTATGAAGAA 120 

GGATGATCAG ATTGCTGCTG CTATGGTTCT GAGGGGAATG GCTAAGGATG GGCAATTTGC 180 

TTTGAAGGAT GCTGCTGCTG CTCATGAAGG GACTGTTAAG AATG CTGTTG ATATAATAAA 240 
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GGCTGCTGCG GAAGCTGCAA GTGCTGCAAG TGCTGCTACT GGT 283 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) topology: unknown 

(ii) MOLECULE type: dna (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TTTATTATAT caacagattc ttaacagtcc cttcatgagc agcagcagca tccttcaaag 60 

CAAATTGCCC ATCCTTAGCC ATTCCCCTCA GAACCATAGC AGCAGCAATC TGATCATCCT 120 

TCTTCATAGT AGCAAACGCC GCAGCAGCAT CATTATCACC CGCACCCCCA ATAGCCGCGT 180 

CAATCGGATT TGTACGATCC GCAGCCTTCT TACCCTGCTT CTCACCACCA TCC 233 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 194 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE type: dna (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCGTGCAAGC TGGGTTGAAG AAGGTTGGGG ATGTTGTTAA GAATAGTGAG GCAAAAGATG 60 

GTGATGCGGC GAGTGTTAAT GGGATTGCTA AGGGGATAAA GGGGATTGTT GATGCTGCTG 120 

AGAAGGCTGA TGCGAAGGAA GGGAAGTTGG TATGTGGCTG GTGCTGCTGG TGAAACTAAC 180 

AAGGAAGCGG CCGC 194 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) topology: unknown 

(ii) molecule type: dna (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GCGGCCGCTT GAGGAAGCTG CAAGTGCTGC AAGTGCTGCT ACTGGTAATG CAGCGATTGG 60 
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GGATGTTGTT AAGAATAGTG GGGCAGCAGC AAAAGGTGGT GAGGCGGCGA GTGTTAATGG 120 

GATTGCTAAG GGGATAAAGG GGATTGTTGA TGCTGCTGGA AAGGCTGATG CGAAGGAAGG 180 

GAAGTTGGAT GCTACTGGTG CTGAGGGTAC GACTAACGTG AATGCTGGGA AGTTGTTTGT 240 

GAAGAGGGCG GCTGATGATG GTGGTGATGC AGATGATGCT GGGAAGGCTG CTGCTGCGGT 300 

TGCTGCAAGT GCTGCTACTG GTAATGCAGC GATTGGAGAT GTTGTTAATG GTGATGTGGC 360 

AAAACAAAA 369 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 142 base pairs 

(B) type: nucleic acid 
(c) strandedness: double 
(D) topology: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AAGGATGGTG ATGATAAGCA GGGTAAGAAG GCTGAGGATG CTACAAATCC GATTGACGCG 60 
GCTATTGGGG GTGCAGGTGC GGGTGCTAAT GCTGCTGCGG CGTTTAATAA TATGAAGAAG 120 
GATGATCAGA TTGAGCGGCC GC 142 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 

(C) strandedness: double 

(D) TOPOLOGY: unknown 

(ii) molecule type: dna (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGTGAAACTA ACAAGGATGC TGGGAAGTTG TTTGTGAAGA AGAATGGTGA TGATGGTGGT 60 

GATGCAGGTG ATGCTGGGAA GGCTGCTGCT GCGGTTGCTG CTGTTAGTGG GGAGCAGATA 120 

TTAAAAGCGA TTGTTGATGC TG CTAAAG AT GGTGATAAGA CGGGGGTTAC TGATGTAAAG 180 

GATGCTACAA ATCCGATTGA CGCGGCTATT 210 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 base pairs 

(B) type: nucleic acid 
(c) strandedness: double 
(D) topology: unknown 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TATATAATAA AGGCTGCTGC GAAGCTGCAA GTGCTGCAAG TGCTGCTACT GGTAGTGCAG 60 

CAATTGGGGA TGTTGTTAAT GGTAATGGAG CAACAGCAAA AGGTGGTGAT GCGAAGTGTT 120 

AATGGGATTG CTAAGGGGAT AAAGGGGATT GTTGATGCTG CTGAGAAGGC TGATGCGAAG 180 

GAAGGGAAGT TGGATGTGGC TGGTGATGCT GGTGAAACTA ACAAGGAAGC GGCCGC 236 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 199 base pairs 

(B) type: nucleic acid 
(c) strandedness: double 
(D) topology: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATGAGAGGAT CTCATCACCA TCACCATCAC ACGGATCCCC CGGGCTGCAG GAATTCGCGG 60 

CCGCTGAAGG CTGATGCGAA GGAAGGGAAG TTGGATGTGG CTGGTGCTGC TGGTGAAACT 120 

AACAAGGATG CTGGGAAGTT GTTTGTGAAG AAGAATAATG AGGGTGGTGA AGCAAATGAT 180 

GCTGGGAAGG CTGCTGCTG 199 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCGCTGGAT GATCAGATTG CTGCTGCTAT GGTTGTGAGG GGAATGGCTA AGGATGGGCA 60 
GTTTGCTTTG AAGGATGATG CTGCTAAGGA TGGAGATAAA ACGGGGGTTG CTGCGGATGT 120 
GAAAATCCGA TTGACGCGGC TATTGGGGGT GCGGATGCTG ATGCTGCGGC GTTTAATAAG 180 
GAGGGGATGA AGAAGGATGA TCAGATTGCT GCTGCTATGG TTCTGAGGGG AATGGCTAAG 240 
GATGGGCAGT TTGCTTTGAC GAATAATGCT GC 272 
(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) length: 289 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ACTGTTAAGA ATGCTGTTGA TATAATAAAG GCTGCTGCGG AAGCTGCAAG TGCTGCAAGT 60 

GCTGCTACTG GTAGTGCAGC AATTGGGGAT GTTGTTAATG GTAATGGAGC AACAGCAAAA 120 

GGTGGTGATG CGAAGAGTGT TAATGGGATT GCTAAGGGGA TAAAGGGGAT TGTTGATGCT 180 

GCTGAGAAGG CTGATGCGAA GGAAGGGAAG TTGGATGTGG CTGGTGATGC TGGTGAAACT 240 

AACAAGGATG CTGGGAAGTT GTTTGTGAAG AACAATGGTA ATGAGGGTA 289 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 142 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

G CCG CTT ACA AAT CCG ATT GAC GCG GCT ATT GGG GGG AGT GCG GAT 46 
Pro Leu Thr Asn Pro lie Asp Ala Ala lie Gly Gly Ser Ala Asp 
1 5 10 15 

CGT AAT GCT GAG GCG TTT GAT AAG ATG AAG AAG GAT GAT CAG ATT GCT 94 
Arg Asn Ala Glu Ala Phe Asp Lys Met Lys Lys Asp Asp Gin lie Ala 
20 25 30 

GCT GCT ATG GTT CTG AGG GGA ATG GCT AAG GAT GGG CAG TTT GCT TTG 142 

Ala Ala Met val Leu Arg Gly Met Ala Lys Asp Gly Gin Phe Ala Leu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) type: amino acid 
(D) TOPOLOGY: linear 



Page 8 



(ii) molecule type: protein 

(xi) SEQUENCE description: seq id NO: 14: 

pro Leu Thr Asn Pro lie Asp Ala Ala lie Gly Gly Ser Ala Asp Arg 
1 5 10 15 

Asn Ala Glu Ala Phe Asp Lys Met Lys Lys Asp Asp Gin lie Ala Ala 
20 25 30 

Ala Met val Leu Arg Gly Met Ala Lys Asp Gly Gin Phe Ala Leu 
35 ~ 40 45 
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